Mining texts to efficiently generate global data on political regime types
نویسندگان
چکیده
منابع مشابه
proposing a method to classify texts using data mining
today a significant part of available data is saved in text database or text documents. the most important thing is to organize these documents. one way to organize text documents is to classify them. to classify texts is to assign text documents to their actual categories. this has two main steps, i.e. feature- and learning algorithm selection. there have been several methods suggested to clas...
متن کاملEfficiently Mining Regional Outliers in Spatial Data
With the increasing availability of spatial data in many applications, spatial clustering and outlier detection has received a lot of attention in the database and data mining community. As a very prominent method, the spatial scan statistic finds a region that deviates (most) significantly from the entire dataset. In this paper, we introduce the novel problem of mining regional outliers in spa...
متن کاملEfficiently Mining Co-Location Rules on Interval Data
Four new p-aminoacetophenonic acids, named (2E)-11-(4′-aminophenyl)-5,9dihydroxy-4,6,8-trimethyl-11-oxo-undec-2-enoic acid (1), 9-(4′-aminophenyl)-3,7dihydroxy-2,4,6-trimethyl-9-oxo-nonoic acid (2), (2E)-11-(4′-aminophenyl)-5,9-O-cyclo4,6,8-trimethyl-11-oxo-undec-2-enoic acid (3) and 9-(4′-aminophenyl)-3,7-O-cyclo-2,4,6trimethyl-9-oxo-nonoic acid (4), were isolated from an endophyte Streptomyce...
متن کاملEfficiently Methods for Embedded Frequent Subtree Mining on Biological Data
As a technology based on database, statistics and AI, data mining provides biological research a useful information analyzing tool. The key factors which influence the performance of biological data mining approaches are the large-scale of biological data and the high similarities among patterns mined. In this paper, we present an efficient algorithm named IRTM for mining frequent subtrees embe...
متن کاملHandling Texts ? A Challenge for Data Mining
The amount of data in free form by far surpasses the structured records in databases in their number. However, standard learning algorithms require observations in the form of vectors given a fixed set of attributes. For texts, there is no such fixed set of attributes. The bag of words representation yields vectors with as many components as there are words in a language. Hence, the classificat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Research & Politics
سال: 2015
ISSN: 2053-1680,2053-1680
DOI: 10.1177/2053168015589217